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VACCIHE COMPOSITIONS 

• 



This invention relates to DMA constructs, replicable 
expression vectors containing the constructs, bacteria 
containing the constructs and vaccines containing the 
bacteria or fusion proteins expressed therefrom. More 
particularly, the invention relates to novel DNA constructs 
encoding the O fragment of tetanus toxin, and to fusion 
proteins containing tetanus toxin C- fragment. 

It is known to prepare DNA constructs encoding two or 
more heterologous proteins with a view to expressing the 
proteins in a suitable host as a single fusion protein. 
However, it has often been found that fusing two proteins 
together in this way leads to an incorrectly folded 
chimaeric protein which no longer retains the properties of 
the individual components. For example, the B-subunits of 
the Vibrio cholerae (CT-B) and E. coli (LT-B) enterdtoxins 
are powerful mucosal immunogens but genetic fusions to 
these subunits can alter the structure and properties of 
the carriers and hence their i mmnnogenicity (see M. 
Sandkvist et al. J. Bacteriol. 169 , pp4570-6, 1987, 
Clements et al. 1990 and M. Lipscombe et al . M ol . 
Microbiol. 5, pp 1385, 1990). Moreover, many heterologous 
proteins expressed in bacteria ar not produced in soluble 
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properly folded or active fonts and tend to accumulate as 
insoluble aggregates (see C. Schein et al. Bio/Technology 
6/ pp 291-4, 1988 and R. Halenbeck et al, Bio/Technology 7, 
pp 710-5 , 1989. 

In our earlier unpublished international patent 
application PCT/GB93/01617 / it is disclosed that by 
providing a DNA sequence encoding tetanus toxin Of ragment 
(TetC) linked via a w binge region" to a second sequenc 
encoding an antigen, the expression of the sequence in 
bacterial cells is enhanced relative to constructs wherein 
the O fragment is absent. For example, the expression 
level of the full length P28 glutathione 5-tranf erase 
protein of S . mansoni when expressed as a fusion to TetC 
from the nirB promoter was greater than when the P28 
protein was expressed alone from the nirB promoter. The 
TetC fusion to the full length P28 protein of S . mansoni 
was soluble and expressed in both £L coli and S. 
typhimurium. In addition, the TetC-P28 fusion protein was 
capable of being affinity purified by a glutathione agarose 
matrix, suggesting that the P28 had folded correctly to 
adopt a conformation still capable of binding to its 
natural substrate. It was previously considered that a 
hinge region, which typically is a sequence encoding a high 
proportion of prolinte and/or glycine amino acids,* is 
essential for promoting the independent folding of both the 
TetC and the antigenic protein fused thereto. However, it 
has now been discovered, surprisingly in view f the 
previous studies on CT-B and LT-B referred to ab ve, that 
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when the hing region is omitted between the TetC and a 
second antigen such as P28, the proteins making up the 
fusion do exhibit correct folding as evidenced by affinity 
purification on a glutathione agarose matrix. 

Accordingly, in a first aspect, the invention provides 
a DNA construct comprising a DNA sequence encoding a fusion 
protein of the formula TetC-(Z) a ~Het, wherein TetC is the C 
fragment of tetanus toxin, or a protein comprising the 
epitopes thereof; Het is a heterologous protein; Z is an 
amino acid, and a is zero or a positive integer, provided 
that (Z) a does not include the sequence Gly-Pro. 

Typically (Z), is a chain of 0 to 15 amino acids, for 
example 0 to 10, preferably less than 6 and more preferably 
less than 4 amino acids* 

In one embodiment (Z) 3 is a chain of two or three amino 
acids, the DNA sequence for which defines a restriction 
endonuclease cleavage site. 

In another embodiment, a is zero. 

Usually the group (Z) 2 will not contain, 
simultaneously, both glycine and proline, and generally 
will not contain either glycine or proline at all. 

In a further embodiment, (Z) a is a chain of amino acids 
provided that when a is 6 or more, (Z) a does not contain 
glycine or proline. 

The group (Z) a may be a chain of amino acids 
substantially devoid of biological activity. 

In a second aspect the invention provid s a replicable 
expression vector, for example suitabl for use in 
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bacteria, containing a DNA construct as hereinbefore 
defined* 

In an ther aspect, the invention provides a host (e.g. 
a bacterium) containing a DNA construct as hereinbefore 
defined, the DNA construct being present in the host either 
in the form of a replicable expression vector such as a 
plasmid, or being present as part of the host chromosome, 
or both. 

In a further aspect, the invention provides a fusion 
protein of the form TetC-(Z) a -Het as hereinbefore defined, 
preferably in substantially pure form, said fusion protein 
being expressible by a replicable expression vector as 
hereinbefore defined. 

In a further aspect the invention provides a process 
for the preparation of a bacterium (preferably an 
attenuated bacterium) which process comprises transforming 
a bacterium (e.g. an attenuated bacterium) with a DNA 
construct as hereinbefore defined. 

The invention also provides a vaccine composition 
comprising an attenuated bacterium, or a fusion protein, as 
hereinbefore defined, and a pharmaceutically acceptable 



The heterologous protein "Het" may for example be a 
heterologous antigenic sequence, e.g. an antigenic sequence 
derived from a virus, bacterium, fungus, yeast or parasite. 

Examples of viral antigenic sequences are sequences 
derived from a type of human immunodeficiency virus (HIV) 
such as HIV-l or HIV-2, the CD4 rec ptor binding sit from 
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HIV, for example from HIV-1 or -2., hepatitis A, B or C 
virus, human rhinovirus such as type 2 or type 14, Herpes 
simplex virus, poliovirus type 2 or 3, foot-and-mouth 
disease virus (FMDV), rabies virus, rotavirus, influenza 
virus, coxsackie virus, human papilloma virus (HPV), for 
example the type 16 papilloma virus, the E7 protein 
thereof, and fragments containing the E7 protein or. its 
epitopes; and simian immunodeficiency virus (SIV). 

Examples of antigens derived from bacteria are those 
derived from Bordetella pertussis (e.g. P69 protein and 
filamentous haemagglutinin (FHA) antigens), Vibrio 
cholerae, Bacillus anthracis, and E.coli antigens such as 
E.coli heat Labile toxin B subunit (LT-B), E.coli K8B 
antigens, and enterotoxigenic E.coli antigens. Other 
examples of antigens include the cell surface antigen CD4, 
Schistosoma mansoni P28 glutathione S-transf erase antigens 
(P28 antigens) and antigens of flukes, mycoplasma, 
roundworms, tapeworms, Chlamydia trachomatis , and malaria 
parasites, eg. parasites of the genus Plasmodium or 
babesia, for example Plasmodium falciparum,, and peptides 
encoding i mmu nogenic epitopes from the aforementioned 
antigens . 

Particular antigens include the full length 
Schistosoma mansoni P28, and oligomers (e.g. 2, 4 and 8- 
mers) of the immunogenic P28 aa 115-131 peptide (which 
contains both a B and T cell epitope), and human papilloma 
virus E7 protein, Herp s simplex antigens, foot and mouth 
diseas virus antigens and simian immunodeficiency virus 
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The DNA constructs of the present invention may 
contain a promoter whose activity is induced in response to 
a change in the surrounding environment. An example of 
such a promoter sequence is one which has activity which is 
induced by anaerobic conditions* A particular example of 
such a promoter sequence is the nirB promoter which has 
been described, -^r^-example in International Patent 



Application ^PCT/GB92/00387x% The nirB promoter has been 



isolated from E-coli , where it directs expression of an 
operon which includes the nitrite reductase gene nirB 
(Jayaraman et al, J. M61. Biol. 196, 781-788, 1987), and 
nirD , nirC , cysG ( Peakman et al, Eur. J. Biochem. 191 , 
315323, 1990). It is regulated both by nitrite and by 
changes; in the oxygen tension of the environment, becoming 
active when deprived of oxygen, (Cole, Biochem, Biophys. 
Acta. 162 , 356*368, 1968)* Response to anaerobiosis is 
mediated through the protein FNR, acting as a 
transcriptional activator, in a mechanism common to many 
anaerobic respiratory genes. By deletion and mutational 
analysis the part of the promoter which responds solely to 
anaerobiosis has been isolated and by comparison with other 
anaerobically regulated promoters a consensus FNR-binding 
site has been identified (Bell et al, Nucl, Acids. Res. 17, 
3865-3874, 1989; Jayaraman et al, Nucl, Acids, Res. 17, 
135-145, 1989). It has also been shown that the distance 
between the putative FNR-binding site and the -10 horn logy 
region is critical (Bell et al, M lec. Microbiol. 4, 1753- 
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1763, 1990). It is therefore preferred to use only that 
part of the nirg promoter which responds solely to 
anaerobiosis. As used herein , references to the nirB 
promoter refer to the promoter itself or a part or 
derivative thereof which is capable of promoting expression 
of a coding sequence under anaerobic conditions. The 
preferred sequence, and which contains the nirB promoter 
is: 

JUITTCAGGTAAATTTGATGTACATCAAATGGTACCCCTTGCTGAATCGTTAAGG 
TAGGCGGTAGGGCC (SEQ ID NO: 1) 

In a most preferred aspect, the present invention 
provides a DMA molecule comprising the nirB promoter 
operably linked to a DNA sequence encoding a fusion protein 
as hereinbefore defined* 

In another preferred aspect of the invention, there is 
provided a replicable expression vector, suitable for use 
in bacteria, containing the nirB promoter sequence operably 
linked to a DNA sequence encoding a fusion protein as 
hereinbefore defined* 

The DNA molecule or construct may be integrated into 
the .bacterial chromosome, e.g. by methods known per se , and 
thus in a further aspect, the invention provides a 
bacterium having in its chromosome, a DNA sequence or 
construct as hereinbefore defined. 

Stable expression of the fusion protein can be 
obtained in vivo . The fusion protein can be expressed in 
an attenuated bacterium which can thus be used' as a 
vaccine . 
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The attenuated bacterium may be selected from the 
gen ra Salmonella , Bordetella , Vibrio , Haemophilus . 
Neisseria and Yersinia. Alternatively, the attenuated 
bacterium may be an attenuated strain of enterotoxigenic 
Escherichia coli . In particular the following species can 
be mentioned: S. typhi - the. cause of human typhoid; 
S . typhimurium - the cause of salmonellosis in several 
animal species; S.enteritidis - a cause of food poisoning 
in humans; S . choleraesuis - a cause of salmonellosis in 
Pigs; Bordetella pertussis - the cause of whooping cough; 
Haemophilus influenzae - a cause of meningitis; Neisseria 
gonorrhoea the cause of gonorrhoea; and Yersinia - a cause 
of food poisoning. 

Examples of attenuated bacteria are disclosed in, for 
example EP-A-0322237 and EP-A-0400958, the disclosures in 
which are incorporated by reference herein. 

An attenuated bacterium containing a DNA construct 
according to the invention, either present in the bacterial 
chromosome, or in plasmid form, or both, can be used as a 
vaccine. Fusion proteins (preferably in substantially pure 
form) expressed by the bacteria can also be used in the 
preparation of vaccines. For example, a purified TetC-P28 
fusion protein in which the TetC protein is linked via its 
C- terminus to the P28 protein with no intervening hinge 
region has been found to be immunogenic on its own. In a 

• • • 

further aspect therefore, the invention provides a vaccine 
compositi n comprising a pharmaceutical ly acceptable 
carrier or diluent and, as active ingredient, an attenuated 
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bacterium or fusion protein as b reinbefore defined 

The vaccine may comprise one or mor 
ad j uvant s . 

The vaccine is advantageously presented in a 
lyophilised form, for example in a capsular form, for oral 
administration to a patient. Such capsules may be provided 
with an enteric coating comprising, for example, Eudragit 
"S w , Eudragit "L n , Cellulose acetate, Cellulose acetate 
phthalate or Hydroxypropylmethyl Cellulose. These 

i 

capsules may be used as such, or alternatively, the 
lyophilised material may be reconstituted prior to 

* 

administration, e.g. as a suspension. Reconstitution is 
advantageously effected in buffer at a suitable pH to 
ensure the viability of the organisms. In order to protect 
the attenuated bacteria and the vaccine from gastric 
acidity, a sodium bicarbonate preparation is advantageously 

administered before each administration of the vaccine. 

.-I 

Alternatively, the vaccine may be prepared for parenteral 
administration, intranasal administration or intramammary 
administration. 

The attenuated bacterium containing the DNA construct 
or fusion protein of the invention may be used in the 
prophylactic treatment of a host, particularly a human host 
but also possibly an animal host. An infection caused by 
a microorganism, especially a pathogen, may therefore be 
prevented by administering an effective dose of an 
attenuated bacterium acc rding to the invention. The 
bacterium then expresses the fusi n protein which is 
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capable of raising antibody to the micro-organism. The 
dosage empi yed will be dependent on vari us factors 
including the size and weight of the host, the type of 
vaccine formulated and the nature of the fusion protein. 

An attenuated bacterium according to the present 
invention may be prepared by transforming an attenuated 
bacterium with a DNA construct as hereinbefore defined. 
Any suitable transformation technique may be employed, such 
as electroporation . In this way, an attenuated bacterium 
capable of expressing a protein or proteins heterologous to 
the bacterium may be obtained. A culture of the attenuated 
bacterium may be grown under aerobic conditions. A 
sufficient amount of the bacterium is thus prepared for 
formulation as a vaccine, with minimal expression of the 
fusion protein occurring. 

The DNA construct may be a replicable expression 
vector comprising the nirB promoter operably linked to a 
DNA sequence encoding the fusion protein. The nirB promoter 
may be inserted in an expression vector, which already 
incorporates a gene encoding one of the heterologous 
proteins (e.g. the tetanus toxin C fragment), in place of 
the existing promoter controlling expression of the 
protein. The gene encoding the other heterologous protein 
(e.g. an antigenic sequence) may then be inserted. The 
expression vector should, of course, be compatible with the 
attenuated bacterium into which the vector is to be 
inserted. 

the expressi n vector is provided with appropriate 
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transcriptional and trans 1 at ional control elements 
including, besides tb nirB promoter, a transcriptional 
termination site and translational start and stop codons. 
An appropriate ribosome binding site is provided. Tbe 
vector typically comprises an origin of replication and, if 

* 

desired, a selectable marker gene such as an antibiotic 
resistance gene* The vector may be a plasmid. 

The invention will now be illustrated but not limited, 
by reference to the following examples and the accompanying 
drawings, in which: 

Figure 1 is a schematic illustration of the 
construction of plasmid pTECHl; 

Figure 2 illustrates schematically the preparation of 
the plasmid pTECHl-28 from the starting materials pTECHl 
and PUC19-P28; 

Figure 3 illustrates schematically the preparation of 
the plasmid pTECH3-P28 from the starting materials plasmids 
PTECH1-P28 and pTETnirlB; 

Figures 4 and 5 are western blots obtained from 
bacterial cells harbouring the pTECH3~P28 construct; and 

• Figure 6 . illustrates the glutathione affinity 
purification of TetC fusions as determined by SDS-PAGE and 
Coomassie Blue Staining* 

In accordance with the invention a vector was 
, constructed to allow genetic fusions to the O terminus of 
the highly immunogenic C fragment of tetanus toxin, without 
the use f a heterologous hinge domain. A fusion was 
constructed, with the gen encoding the protective 28kDa 

NSDOCfD: <WO PS04151A2 I > 



WO 95/64151 PCT/GB94/01647 

12 

glutathione 5- transferase from Schistos ma mansoni . The 
recombinant vector was transformed into Salmonella 
typhimurium (SL338; rm*). The resulting chimeric protein 
was stably expressed in a soluble form in salmonella as 
assessed by western blotting with fragment C and 
glutathione S- transferase antisera. Furthermore it was 

* • 

found that the P28 component of the fusion retains th 
capacity to bind glutathione. 

The construction of the vector and the properties of 
the fusion protein expressed therefrom are described in 
more detail below. 
EXAMPLE 1 

Preparation of pTECHl 

The preparation of pTECHl, a plasmid incorporating the 
nirB promoter and TetC gene, and a DNA sequence encoding a 
hinge region and containing restriction endonuc lease sites 
to allow insertion of a gene coding for a second or guest 
protein, is illustrated in Figure 1. Expression plasmid 
pTETnirlS, the starting material shown in Figure 1, was 
constructed from pTETtacllB (Makoff etal, Nucl. Acids Res. 
17 10191-10202, 1989); by replacing the EcoRI-Aoal region 
(1354bp) containing the lad gene and tac promoter with the 
following pair of oligos 1 and 2: 

01igo-l 5 1 AATTCAGGTAAATTTGATCTACATCAAATGGTACCCCTTGCTGAAT 
CGTTAAGGTAGGCGGTAGGGCC-3 1 (SEQ ID NO: 2) 

01 igo-2 3 ' -GTCCATTTAAACTACATGTAGTTTACCATGGGGAACGACTTA 
GCAATTCCATCCGCCATC-5 1 (SEQ 10 NO: 3) 
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Th oligonucleotides w re syntbesised on a Pharmacia 
Gene Assembler and the resulting plasmids confirmed by 
sequencing (Makoff et al, Bio/Technology 7, 1043-1046, 
1989). 

The pTETnirlS plasmid was then used for construction 
of the pTECHl plasmid incorporating a polyl inker region 
suitable as a site for insertion of heterologous DNA to 
direct the expression of fragment C fusion proteins. 
pTETnirl5 is a known pAT153-based plasmid which directs the 
expression of fragment C. However, there are no naturally 
occurring convenient restriction sites present at the 
3 '-end of the TetC gene. Therefore, target sites, preceded 
by a hinge region, were introduced at the 3 ' -end of the 
TetC coding region by means of primers SEQ ID NO: 4 and SEQ 

* 

ID NO: 5 tailored with "add-on" adapter sequences (Table 
1), using the polymerase chain reaction (PCR) [K. Mullis et 
al, Cold Spring Harbor Sym. Quant. Biol. 51, 263-273 1986]. 
Accordingly, pTETnirlS was used as a template in a PCR 
reaction using primers corresponding to regions covering 
the SacII and BamHI sites. The anti-sense primer in this 
amplification was tailored with a 38 base 5' -adaptor 
sequence. The anti-sense primer was designed so that a 
sequence encoding novel Xbal, Spe l and BamH I sites were 
incorporated into the PCR product. In addition, DNA 
sequences encoding additional extra amino acids including 
proline were incorporated (the hinge regions) and a 
translation st p codon signal in frame with the fragment C 
open reading frame. 
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The PGR product was gel-purified and digested with 
SacII and BamHI, and cloned into the residual 2.8 kb vector 
pTETnirlS which had previously been digested by SacII and 
BamH I. The resulting plasmxd purified from transformed 
colonies and named pTECH 1 is shown in Figure 1. 
Heterologous sequences such as the sequence encoding the 
Schistosoma mansoni P28 glutathione £- transferase (P28) 
were cloned into the Xbal Spe l and BamH I sites in 
accordance with known methods. 

The DNA sequence of the plasmid pTECHl is shown in the 
sequence listing as SEQ ID NO: 6. 

» 

TABLE 1 

DNA SEQUENCES OF OLIGONUCLEOTIDES UTILISED IN THE 
CONSTRUCTION OF THE TETC-HINGE VECTORS 

A) . Primer 1. Sense PCR (2 liner). (SEQ ID NO: 4) 

SacII 

5' AAA GAC TCC GCG GGC 6AA GTT -3' 
TETANUS TOXIN C FRAGMENT SEQ. 

B) .Primer 2. Anti^Sense PCR Primer (64mer). (SEQ ID NO: 5) 

BaiHI STOP Spel Ibal £33 

5'- CTAT 66A TCC TTA ACT AGT GAT TCT ASA *»r 733 2Z2 
GTC GTT GGT CCA ACC TTC ATC GGT -3' 
TETANUS TOXIN C FRAGMENT SEQ. 3' -END 

EXAMPLE 2 

Construction of pTECHl-P28 

A P28 gene expression cassette was produced by PCR 
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using pUC19-P28 DNA (a kind gift from Dr R Pierce, Pasteur 
Institute, Lille) as template. Oligonucleotide primers 
were designed to amplify the full length P28 gene beginning 
with the start codon and terminating with the stop codon. 
In addition, the sense and antisense primers were tailored 

* • 

with the restriction sites for Xbal and BamH I respectively. 
The primers are shown in the sequence listing as SEQ ID NO: 
7 and SEQ ID NO: 8. 

* * 

The product was gel-purified and digested with Xbal and 
BamHI and then cloned into pTECHl which had previously been 
digested with these enzymes and subsequently gel-purified. 
The DNA sequence of pTECHl - P28 is shown in sequence 
listing as SEQ ID NO: 9. 

Expression of the TetC-Hinge-P28 fusion protein 

Several bacterial strains, namely S , typhimirium 
strains SL 5338 (A. Brown et al , J.Infect.Dis. 155 , 86-92, 
1987) and SL3261 and E. coli (TG2) were transformed with 
pTECHl-P28 by means of electroporation. SL3261 strains 
harbouring the pTECHl-P2B plasmid have been deposited at 
the National Collection of Type Cultures, 61 Colindale 
Avenue, London, NW9 5HT, UK under the accession number NCTC 
12833. A strain of SL3261 containing the pTECHl plasmid 
has been deposited under accession number NCTC 12831. The 
identity of recombinants was verified by restriction 
mapping of the plasmid DNA harboured by the cells. Further 
expression of the TetC-P28 fusion protein was then 
evaluated by SDS-PAGE and western blotting f bacterial 
cells harbouring the construct It was found that the 
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fusion protein remains soluble, cross-reacts with antisera 
to both TetC and P28, and is also of the expect d molecular 
weight, 80kDal, for a full length fusion. 

The fusion protein was stably expressed in B.coli 
(TG2) and S - typhimurium (SL5338,SL3261) as judged by 
SOS-PAGE and western blotting* Of interest was a band of 
50kDal which co-migrates with the TetC-Hinge protein alone 
and cross-reacts exclusively with the anti-TetC sera is 

* 

visible in a western blot. As the codon selection in the 
hinge region has been designed to be suboptimal, the rare 
codons may cause pauses during translation which may 
occasionally lead to the premature termination of 
translation, thus accounting for this band. 
Affinity purification of the TetC-P28 fusion 

Glutathione is the natural substrate for P28, a 
glutathione 5- transferase. The amino acid residues 
involved in binding glutathione are thought to be spatially 
separated in the primary structure of the polypeptide and 
brought together to form a glutathione binding pocket in 
the tertiary structure (P. Reinemer et al. EHBO, J8, 1997- 
2005, 1991). In order to gauge whether the P28 component 

♦ 

of the fusion has folded correctly to adopt a conformati n 
capable of binding glutathione, its ability to be affinity 
purified on a glutathione-agarose matrix was tested. The 
results obtained (not shown) demonstrated that TetC-P28 can 
indeed bind to the matrix and the binding is reversible, as 
the fusion can be competitively eluted with free 
glutathione . 
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EXAMPLE 3 

Construction of PTBCH3-P28 

The plasmid pTECHl-P28 directs the expression of the 
Sg mansoni P2B protein as a C- terminal fusion to fragment. 
C from tetanus toxin separated by a heterologous hinge 
domain. Expression of the fusion protein is under the 
control of the nirB promoter. The vector pTECB3-P28 was in 
part constructed from the plasmid pTETnirlS by the 
polymerase chain reaction (PGR) using the high fidelity 
thermostable DNA polymerase from Pyrococcus fusorius , which 
possesses an associated 3* 5' exonuclease proofreading 
activity* The sequence of steps is summarised in Figure 5. 
In order to generate a TetC-hingeless replacement cassette, 
the segment of DNA from the unique SacII site within the 

^^^^^^^^^^^^^^^^ 

TetC gene to the final codon was amplified by means of the 
PCR reaction, using pTETnirlS as template DNA. The primers 
used in the PCR amplification are shown in the sequence 
listing as SEQ ID NO: 10 and SEQ ID NO: 11. The antisense 
primer in this amplification reaction was tailored with an 
Xbal recognition sequence* 

The amplification reaction was performed according to 
the manufacturer's instructions (Stratagene, La Jolla, CA, 
USA) . The product was gel -purified, digested with SacII 

« 

and Xbal, and then cloned into the residual pTECHl-P28 
vector which had been previously digested with the 
respective enzymes SacII and Xbal * The resulting vector 
was designated pTECB3-P28. The DNA sequence f pTECH3-P28 
is shown in the sequenc listing as SEQ ID NO: 12. 
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EXAMPLE 4 

Transformation of s. typhimurium SLS338 fgalE rVj with 
PTECH3-P28, and Analysis of the Transfoncanta 

S. typhimurium SL5338 (galE r"m*) were cultured in either L 
or YT broth and on L-agar with ampicillin (50 g/ml) if 
appropriate and were transformed with the pTECH3-P28 
plasmid. The transformation protocol was based on the 
method described by MacLachlan and Sanderson. (MacLachlan 
PR and Sanderson KE, 1985. Transformation of Salmonella 
typhimurium with plasmid DMA : differences between rough 
and smooth strains. J. Bacteriology 161, 442-445). 

A 1ml overnight culture of S. typhimurium SL5338 (r"m*; 
Brown A, Hormaeche CE, Oemarco de Hormaeche R, Dougan G, 
Winther M, Maskell D, and Stocker BAD, 1987. J. 
Infect -D is, 155 , 86-92) was used to inoculate 100 ml of LB 
broth and shaken at 37 °C until the culture reached OD 650 = 
0.2. The cells were harvested at 3000 x g and resuspended 
in 0.5 volumes if ice-cold 0.1M MgCl 2 . The cells were 
pelleted again and resuspended in 0.5 volumes of ice-cold 
CaCl 2 . This step was repeated once more and the cells 
resuspended in 1 ml of 0.1M CaCl 2 to which was added 50 pi 
of TES (50 mM Tris, 10 mM EDTA, 50 mM NaCl, pH 8.0). The 
cells were incubated on ice for 45 to 90 minutes. To 150pl 
of cells was added lOOng of plasmid DMA in 1 - 2pl . . The 
mixture was incubated on ice for 30 minutes prior to heat- 
shock at 42 °C for 2 minutes, and immediate re incubation on 
ice for 1 minute. To the transformed mixture was added 2 
ml f LB broth and incubated for 1.5 hours to allow 
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lactamase* F 1 lowing incubation 20 ixl and 200 pi of c lis 
were spread on to LB agar plates containing 50 ug/mlof 
ampicillin. The plates were dried and incubated at 37*C 
overnight* 

The identity of recombinants was verified by 
restriction mapping of the plasmid DMA and by western 
blotting with antisera directed against TetC and P28. 



SDS-PAGB and Western Blotting 



Expression of the TetC fusions was tested by SDS-PAGE 
and western blotting* S * typhimurium SL5338 (galE rm + ) 
bacterial cells containing the pTECH3-P28 plasmid and 
growing in mid-log phase, with antibiotic selection, were 
harvested by eentrif ugation and the proteins fractionated 
by 10% SOS-PAGE. The proteins were transferred to a 
nitrocellulose membrane by electrob lotting and reacted with 
either a polyclonal rabbit antiserum directed against TetC 
or the full length P28 protein. The blots were then probed 
with goat anti-rabbit Ig conjugated to horse*- radish 
peroxidase (Dako, High Wycombe* Bucks, UK) and developed 
with 4-chloro-l-napthol ) * The results of the western 
blotting experiments are shown in Figures 4 and 5; Figure 
4 illustrating the results of probing with rabbit anti-TetC 
polyclonal antiserum and Figure 5 illustrates the results 
of probing with rabbit anti-P28 polyclonal antiserum. In 
each case lanes 1, 2 and 3 are independent clones of SL5338 
(PTECH3-P28), lanes 4, 5 and 6 are SL5338 (pTECHl-P28) and 
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lane 7 is SL5338 (pTETnirlS). The mol cular weight markers 
are indicated* From the results, it is evident that the 
fusion protein remains soluble, reacts with antisera to 
both TetC and P28, and is also of the expected molecular 
weight, 80 kDal, for a full length fusion (Figure 4). 
Furthermore the fusion protein appears to be stably 



Glutathione-Agarose Affinity Purification 

Glutathione is the natural substrate for P28, a 
glutathione S- transferase . The amino acid residues 
involved in binding glutathione are thought to be spatially 
separated in the primary structure of the polypeptide and 
brought together to form a glutathione binding pocket in 
the tertiary structure. In order to gauge whether the P28 
component of the fusion has folded correctly to adopt a 
conformation capable of binding glutathione, we tested its 
ability to be affinity purified on a glutathione agarose 



Bacterial cells containing pTECH3-P28 and expressing 
the TetC full length P28 gene fusion were grown to log 
phase, chilled on ice, and harvested by centrifugation at 
2500 x g for 15 min at 4C. The cells were resuspended in 
l/15th the original volume of ice-cold phosphate buffered 
saline (PBS) and lysed by sonication in a MSE Soniprep 150 
(Gallenkamp, Leicester, OK). The insoluble material was 
removed by centrifugation and to the supernatant was added 
1/6 v lume of a 50% slurry of pre-swollen glutathi ne- 
agarose b ads (Sigma, Poole, Dorset, UK) . After mixing 
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gently -at ro a temperature for 1 hour the beads were 
collected by centrifugati n at 1000 x g for 10 sees. The 
supernatant was discarded and the beads resuspended in 20 
volumes of cold PBS-0.5% Triton X100 and the beads 
collected again by centr if ligation. The washing step was 
repeated three more times. The fusion protein was eluted 

* 

by adding 1 volume of SDS-PAGE sample buffer. . For 
comparison purposes, a similar procedure was followed with 
bacterial cells containing the PTECH1-P28 plasmid from 
which TetC-hinge-P28 fusion protein is expressed. Extracts 
from clones containing either plasmid were compared using 
SOS-PAGE and the results are shown in Figure 6. In Figure 
6, lanes 1, 2 and 3 are clones of SL5338 (pTECHl-P28) 
whereas lanes 4, 5 and 6 are independent clones of SL 5338 
(pTECH3-P28). 

The results suggest that the TetOP28 fusion protein 
can indeed bind to the matrix and the binding is reversible 
regardless of the absence of a heterologous hinge domain 
(data not shown) It is possible that a peptide sequence 
present at the O terminus of TetC may in fact impart 
flexibility to this particular region. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: 

(A) NAME: MED EVA HOLDINGS BV 

(B) STREET: CHURCHILL- LAAN 223 

(C) CITY: AMSTERD AM 

(E) COUNTRY: THE NETHERLANDS . 

(F) POSTAL CODE (ZIP): 1078 ED 

p 

% 

(ii) TITLE OF INVENTION: VACCINES 

m 

(iii) NUMBER OF SEQUENCES: 20 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/GB93/01617 

(B) FILING DATE: 30-JUL-1993 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9401787.8 

(B) FILING DATE: 31-JAN-1994 

« 

(2) INFORMATION FOR SEQ ID NO: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

* 

(is) FEATURE: 

(A) NAME/KEY: promoter 

(B) LOCATION: 1..61 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: I: 
AATTCAGGTA AATTTGATGT ACATCAAATG GTACCCCTTG CTGUTCGTT AAGGTAGGCG 60 

GTAGGGCC 68 
(2) INFORMATION FOB SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(zi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
AATTCAGGTA MTTTGATGT ACATCAAATG GTACCCCTTG CTGAATCGTT AAGGTAGGCG 60 
GTAGGGCC 68 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
<iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GTCCATTTAA ACTACATGTA GTTTACCATG GGGAACGACT TAGCAATTCC ATCCGCCATC 60 

(2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(si) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
AAAGACTCCG CGGGCGAAGT T 21 

■ 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(si) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CTATGG&TCC TTAACTAGTG ATTCTAGAGG GCCCCGGCCC GTCGTTGGTC CAACCTTCAT 60 
CGGT 64 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3754 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(zi) SEQUENCE DESCRIPTION: SEQ ID HO: 6: 



TTCAGGTAAA 


TTTGAIGTAC 


ATCAAATGGT ACCCCTTGCT 


(jAATCGTiAA 




00 


AGGGCCCAGA 


TCTTAATCAT 


CCACAGGAGA CTTTCTGATG 


AAAAACCTTG 


ATTGTTGGGT 


120 


CGACAACGAA 


GAAGACATCG 


AT6TTATCCT GAAAAAGTCT 


ACCATTCTGA 


ACTTGGACAT 


180 


CAACAACGAT 


ATTATCTCCG 


ACATCTCT6C TTTCAACTCC 


TCTiafTATCA 


CATATCCAGA 


240 


TGCTCAATTG 


6T6CCGGGCA 


TCAACGGCAA AGCTAJCCAC 


CTGGTTMCA 


ACGAATCTTC 


- 300 


TGAAGTTATC 


GTGCACAAGG 


CCATGGACAT CGAATACAAC 


GACATGTTCA 


ACAACTTCAC 


360 


CGTTAGCTTC 


TGGCTGCGCS 


TTCCGAAAGT TTCTGCTTCC 


CACCTGGAAC 


AGTACGGCAC 


420 


TMCGAGTAC 


TCCATCATCA 


GCTCTATGAA GAAACACTCC 


CTGTCCATCG 


GCTCTGGTTG 


480 


GTCTGTTTCC 


CTGAAGGGTA 


ACAACCTGAT CTGGACTCTG 


AAAGACTCCG 


CGGGCGAAGT 


540 


TCGTCAGATC 


ACTTTCCGCG 


ACCTGCCGGA CAAGTTGAAC 


GCGTACCTGG 


CTAACAAATG 


600 


GGTTTTCATC 


ACTATCACTA 


ACGATCGTCT GTCTTCTGCT 


AACCTGTACA 


TCAACGGCGT 


660 


TCTGATGGGC 


TCCGCTGAAA 


TCACTGGTCT GGGCGCTATC 


CGTGAGGACA 


ACAACATCAC 


720 


TCTTAAGCTG 


GACCGTTGCA 


ACAACAACAA CCAGTACGTA 


TCCATCGACA 


AGTTCCGTAT 


780 


CTTCTGCAAA 


GCACTGAACC 


CGAAAGAGAT CGAAAAACTG 


TATACCAGCT 


ACCTGTCTAT 


840 


C&CCTTCCTG 


CGTGACTTCT 


GGGGTAACCC GCTGCGTTAC 


GACACCGAAT 


ATTACCTGAT 


900 


CCCGGTAGCT 


TCTAGCTCTA 


AAGACGTTCA GCTGAAAAAC 


ATCflCTGACT 


ACATGTACCT 


960 


GACCMCGCG 


CCGTCCTACA 


CTAACGGTAA ACTGAACATC 

• 


TACTACCGAC 


GTCTGTACAA 


1020 


CGGCCTGAAA 


TTCATCATCA 


AACGCTACAC TCCG&ftCA&C 


GAAATCGATT 


CTTTCGTTAA 


1080 


&TCTG6T6AC 


TTCATCAAAC 


TGTACGTTTC TTACAACAAC 


AACGAACACA 


TCGTTGGTTA 


1140 


CCCGAAAGAC 


6GTAACGCTT 


TCAACMCCT GGACAGAATT 


CTGCGTGTTG 


GTTACAACGC 


1200 


TCCGGGTATC 


CCGCT6TACA 


&AAAAATGGA AGCTGTTAAA 


CTGCGTGACC TGAAAACCTA 


1260 


CTCTGTTCAfi 


CTGAAACTGT 


ACGACGACAA AAACGCTTCT 


CTGGGTCTGG TTGGTACCCA 


1320 


CAACGGTCAG 


ATCGGTAACG 

# m 


ACCCGAACCG TG&C&TCCTG 

• 


ATCGCTTCTA ACTG6TACTT - 


1360 


CAACCACCTG AAAGACAAAA 


TCCTGGGTTG CGACTGGTAC 


TTCGTTCCGA CCGATGAAGG 


1440 
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TTGGACCAAC GACGGGCCGG GGCCCTCTAG AATCACTAGT TAAGGATCCG CTAGCCCGCC 1500 

TAATGAGCGG GCTTTTTTTT CTCGGGCAGC GTTGGGTCCT GGCCACGGGT GCGCA7GATC 1560 

GTGCTCCTGT CGTTGAGGAC CCGGCTAGGC TGGCGGGGTT GCCTTACTGG TTAGCAGAAT 1620 

GAATOCCGA TACGCGRGCG AACGTGAAGC GACTGCTGCT GC4AAACGTC TGCGACCTGA 1680 

GCAACAACAT GAATGGTCTT CG 6TT TC C G T GTTTCGTAAA GTCTGGAAAC GCGGAAGTC4 1740 

GCGCTCTTCC GCTTCCTCGC TCACTGACTC GCTGCGCTCG GTCGTTCGGC TGCGGCGAGC 1800 

GGTATCAGCT CACTCAAAGG CGGTAATACG GTTATCCACA GAATCAGGGG ATAACGCAGG I860 

AAAGAACATG TGAGCAAAAG GCCAGCAAAA GGCCAGGAAC CGTAAAAAGG CC6C6TTGCT 1920 

. GGCGTTTTTC CATAGGCTCC GCCCCCCTGA CGAGCATCAC AAAAATCGAC GCTCAAGTCA 1980 

GAGGTGGCGA AACCCGBCAG GACTATAAAG ATACCAGGCG TTTCCCCCTG GAAGCTCCCT 2040 

CGTGCGCTCT CCTGTTCCGA CCCIGCCGCT TACCGGATAC CTGTCCGCCT TTCTCCCTTC 2100 

GGGAAGCGTG GCGCTTTCTC AATGCTCACG CTGTAGGTAT CTCAGTTCGG TGTAGGTCGT 2160 

TCGCTCCAAG CTGGGCTGTG TGCACGAACC CCCCGTTCAG CCCGACCGCT GCGCCTTATC 2220 

CGGTAACTAT CGTCTTGAGT CCAACCCGGT AAGACACGAC TTATCGCCAC TGGCAGCAGC 2280 

CACTGGTAAC AGGATTAGCA GAGCGAGGTA TGTAGGCGGT GCTACAGAGT TCTTGAAGTG 2340 

GTGGCCTAAC TACGGCTACA CTAGAAGGAC AGTATTTGGT ATCTGCGCTC TGCTGAAGCC 2400 

AGTTACCTTC GGAAAAAGAG TTGGTAGCTC TTGATCCGGC AAACAAACCA CCGCTGGTAG 2460 

CGGTGGTTTT TTTGTTTGCA AGCAGCAGAT TACGCGCAGA AAAAAAGGAT CTCAAGAAGA 2520 

TCCTTTGATC TTTTCTACGG GGTCTGACGC TCAGTGGAAC GAAAACTCAC GTTAAGGGAT 2580 

TTTGGTCATG AGATTATCAA AAAGGATCTT CACCTAGATC CTTTTAAATT AAAAATGAAG 2640 

TTTTAAATCA ATCTAAAGTA TATATGAGTA AACTTGGTCT GACAGTTACC AATGCTTMT 2700 

CAGTGAGGCA CCTATCTCAG C6ATCTGTCT ATTTCGTTCA TCCATAGTTG CCTGACTCCC 2760 

» 

CGTCGTGTAG ATAACTACGA TACGGGAGGG CTTACCATCT GGCCCCAGTG CTGCAATGAT 2820 

ACCGCGAGAC CCACGCTCAC CGGCTCCAGA TTTATCAGCA ATAAACCAGC CAGCCGGAAG 2880 

GGCCGAGCGC AGAAGTGGTC CTGCAACTTT ATCCGCCTCC ATCCAGTCTA TTAATTGTTG 2940 

CCGGGAAGCT AGAGTAAGTA GTTCGCCAGT TAATAGTTTG CGCAACGTTG TTGCCATTGC 3000 

TGCAGGCATC GTGGTGTCAC GCTCGTCGTT TGGTATGGCT TCATTCAGCT CCGGTTCCCA 3060 
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CGAGTTACAT GATCCCCCAT GTTGTGCAAA AAAGCGGTTA GCTCCTTCGG 3120 

GTTGTCAGAA GTAAGTTGGC CGCAGTGTTA TCACTCATGG TTATGGCASC 3180 

TCTCTTACTG TCATGCCATC CGTAAGATGC TTTTCTGTGA CTGGTGAGTA 3240 

TCAXTCTGAG AATAGTGTAT GCGGCGACCG AGTTGCTCTT GCCCGGCGTC 3300 

AAIACCGCGC CACATAGCAG AACTTTAAAA GTGCTCATCA TTGGAAAACG 3360 

CGAAAACTCT CAASGATCTT ACCGCTGTTG AGATCCAGTT CGATGTAACC 3420 

CCCAACTGAT CTTCAGCATC TTTTACTTTC ACCAGCGTTT CTGGGTGAGC 3480 

AfiGCAAAATG CCGCAAAAAA GGGAATAAGG GCGACACGGA AATGTTGAAT 3540 

TTCCTTTTTC AATATTATT6 AAGCATTTAT CAGGGTTATT GTCTCATGAG 3600 

TTTGAATGTA TTTAGAAAAA TAAACAAATA GGGGTTCCGC GCACATTTCC 3660 

CCACCTGACG TCTAAGAAAC CATTATTATC ATGACATTAA CCTATAAAAA 3720 

ACGAGGCCCT TTCGTCTTCA A6AA 3754 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CH ARACTERISTICS ; 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TAGTCTAGAA TGGCTGGCGA GCATATCAAG 30 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE: aucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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ACGATCAAGG 
TCCTCCGATC 
ACTGCATAAT 
CTCAACCAAG 
AACACGGGAT 
TTCTTCGGGG 
CACTC6TGCA 
AAAAACAGGA 
ACTCAIACTC 
C66ATACATA 
CCGAAAAGTG 
TAGGCGTATC 
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(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



PCT/GB94/01647 



(zi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 
TTAGGATCCT TAGAAGG6A6 TTGCAGGCCT 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4378 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



30 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TTCAGGTAAA TTTGATGTAC ATCAAATGGT ACCCCTTGCT GAATCGTTAA GGTAGGCGGT 



60 



TET C GENE START CODON 
AGGGCCCAGA TCTTAATCAT CCACAGGAGA CTTTCTGATG AAAAACCTTG ATTGTTGGGT 

CGACAACGAA GAAGACATCG ATGTTATCCT GAAAAAGTCT ACCATTCTGA ACTTGGAC&T 

CAACAACGAT ATTATCTCCG ACATCTCTGG TTTCAACTCC TCTGTTATCA CATATCCAGA 

TGCTCAATTG GTGCCGGGCA TCAACGGCAA AGCTATCCAC CTGGTTAACA ACGAATCTTC 

TGAAGTTATC GTGCACAAGG CCATGGACAT CGAATACAAC GACATGTTCA ACAACTTCAC 

CGTTAGCTTC TGGCTGCGCG TTCCGAAAGT TTCTGCTTCC CACCTGGAAC AGTACGGCAC 

TAACGAGTAC TCCATCATCA GCTCTATGAA GAAACACTCC CTGTCCATCG GCTCTGGTTG 



120 
180 
240 
300 
360 
420 
480 



SacII 

GTCTGTTTCC CTGAAG6GTA ACAACCTGAT CTGGACTCTG AAAGACTCCG CGGGCGAAGT 



540 



TCGTCAGATC ACTTTCCGCG ACCTGCCGSA CAA6TTCAAC GCGTACCTGG CTAACAAATG 



600 



GGTTTTCATC ACTATCACTA ACGATCGTCT GTCTTCTGCT AACCTGTACA TCAACGGCGT 
TCTGATGGGC TCCGCTGAAA TCACTGGTCT GGGCGCTATC CGTGAGGACA ACAACATCAC 



660 



720 
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TCTTA&GCT6 GACCGTTGCA ACAACAACAA CCAGTACGTA 


TCCATCGACA 


AGTTCCGTAT 


780 


CTTCTGCAAA GCACTGAACC CGAMGAGAT CGAAAAACTG 


TATACCAGCT 


ACCTGTCTAT 


.840 


CACCTTCCTG CGTGACTTCT GGGGTAACCC GCTGCGTTAC 


GACACCGAAT 


ATTACCTGAT 


900 


CCCGGTAGCT TCTAGCTCTA AAGACGTTCA GCTGAAAAAC 


ATCACTGACT 


ACATGTACCT 


960 


GACCAACGCG CCGTCCTACA CTAACGGTAA ACTGAACATC 


TACTACCGAC 


GTCTGTACAA 


1020 


* 

CGGCCTGAAA TTGATCATCA AACGCTACAC TCCGAACftAC 


GAAATCGATT 


CTTTCGTTAA 


1080 


ATCTGGTGAC TTCATCAAAC TGTACGTTTC TTACAACfiAC 


AACGAACACA 


TCGTTGGTTA 


1140 


CCCGAAAGAC GGTAACGCTT TCAACAACCT GGACAGAATT 


CTGCGTGTTG 


GTTACAACGC 


' 1200 


TCCG6GTATC CCGCTGTACA AAAAAATGGA AGCTGTTAAA 


CTGCGTGACC 


TGAAAACCTA 


1260 


CTCTGTTCAG CTGAAACTGT ACGACGACAA AAACGCT7CT 


CTGGGTCTGG 


TTGGTACCCA 


1320 


CAACGGTCAG ATCGGTAACG ACCCGAACCG TGACATCCTG 


ATCGCTTCTA 


ACTGGTACTT 


1380 


CAACCACCTG AAAGACAAM TCCTGGGTTG CGACTGGTAC 


TTCGTTCCGA 


CCGATGAAGG 


1440 


HINGE DOMAIN Xbal S.Mansoni 


P28 GENE START 




TTGGACCAAC GACGGGCCGG GGCCCTCTAG AATGGCTGGC GAGCATATCA 


AGGTTATCTA 


1500 


TTTTGACGGA CGCGGACGTG CTGAATCGAT TCGGATGACT 


CTTGTGGCAG 


CTGGTGTAGA 


1560 


• 

C7ACGAAGAT GAGAGAATTA GTTTCCAAGA TTGGCCAAAA 


ATCAAACCAA 


CTATTCCAGA 


1620 


CGGACGATTG CCTGCAGTGA AAGTCACTGA TGATCATGGG 

• 


CACGTGAAAT 


GGATGTTAGA 


1680 


GAGTTTGGCT ATTGCACGGT ATATGGCGAA GAAACATCAT 


ATGATGGGTG 


AAACAGACGA 


1740 


GGAATACTAT AGTGTTGAAA AGTTGATTGG TCATGCTGAA 


GATGTAGAAC 


ATGAATATCA 


1800 


CAAAACTTTG ATGAAGCCAC AAGAAGAGAA AGAGAAGATA 


ACCAAAGAGA 


TATTGAACGG 


1860 


CAAAGTTCCA GTTCTTCTCA ATATGATCTG CGAATCTCTG 


AAAGGGTCGA 


CAGGAAAGCT 


1920 


GGCTGTTGGG GACAAAGTAA CTCTAGCTGA TTTAGTCCTG 


ATTGCTGTCA 


TTGATCATGT 


1980 


GACTGATCTG GATAAAGGAT TTCTAACTGG CAAGTATCCT 


GAGATCCATA 


AACATCGAGA 


2040 


AAATCTGTTA GCCAGTTCAC CGCGTTTGGC GAAATATTTA 


TCGAACAGGC 


CTGCAACTCC 


1 1 AA 

Z10U 


STOP BamHI 
CTTCTAAGGA TCCGCTAGCC CGCCTAATGA GCGGGCTTTT 


TTTTCTCGGG 


CAGCGTTGGG 


2160 


TCCTGGCCAC GGGTGCGCAT GATCGTGCTC CTGTCGTTGA 


GGACCCGGCT 


AGGCTGGCGG 


2220 


GGTTGCCTTA CTGGTTAGCA GAATGAATCA CCGATACGCG 


AGCGAACGTG 


AAGCGACTGC 


2280 
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TGCTGCABAA 


CGTCTGCGAC 


/ww»iiiv»a bob 


iiuAX wtaTuu TCI i Uuu I i 1 






liuutuicivro 


niifiCuVAjvxttfi 




• 


lU* i. \A3Vt iuvw 




LTCVjuILuI 1 




unutwi ax U 




InUltfjl laic 




ika MB MB B MMt 

CACAGAATCA 


GGGGATAACG 


CAGGAAAGAA 


mb mMHtMB mmb iiiammii^/i 

CATGTGAGCA AAAGGCCAGC 


*ll B MMMMB M 

AAAAGGCCAG 


2520 

■ 


GAACCGTAAA 


AAGGCCGCGT 


TGCTGGCGTT 


TTTCCATAGG CTCCGCCCCC 


MfnMB MAt MMB 

CTGACGAGCA 


MA #»Mk Ml 

2580 


TCACAAAAAT 


CGACGCTCAA 


GTCAGAGGTG 


• 

A»MttM* a a a mmmimi M» A At Mt #A A MVi i*t 

GCGAAACCCG ACAGGACTAT 


A&AG&TACCA 


2640 


GGCGTTTCCC 


OCTGGAAGCT 


CCCTCGTGCG 


CTCTCCTGTT CCGACCCTGC 


CGCTTACCGG 


AMA Mk 

2700 


ATACCTGTCC 


GCCTTICTCC 


CTTCGGGAAG 


MMaMMMMMMffM* MMWMB B fBMMMI 

CGTGGCGCTT TCTCAATGCT 


MB IMMMBM M 

CflCGCTGTAG 


MM# a, 

2760 


GTATCTCAGT 


TCGGTGTAGG 


TCGTTCGCTC 


CAAGCTGGGC TGTGTGCACG 


AACCCCCCGT 


Mk AM. Jk 

2820 


TCA6CCCGAC 


CGCTGCGCCT 


TATCCGGTAA 


CTATCGTCTT GAGTCCAACC 


CGGTAAGACA 


2880 


CGACTTATCG 


OCACTGGCAG 


CAGCCACTGG 


TAACAGGATT AGCAGAGCGA 


GGTATGTAGG 


2940 


CGGTGCTACA 


GAGTTCTTGA 


AGTGGTGGCC 


TAACTACGGC T ACACTAGAA 


GGACAGTATT 


3000 


TGGTATCTGC 


GCTCTGCTGA 


AGCCAGTTAC 


CTTCGGAAAA AGAGTTGGTA 


GCTCTTGATC 


3060 


CGGCAAACAA 


ACCACCGCTG 


GTAGCGGTGG 


TTTTTTTSTT TGCAAGCAGC 


AGATTACGCG 


3120 


CAGAAAAAAA 


GGATCTCAAG 


AAGATCCTTT 


GATCTTTTCT ACGGGGTCTG 


ACGCTCAGTG 


3180 


GAACGAAAAC 


TGACGTTAAG 


GGATTTTGGT 


CATGAGATTA TCAAAAAGGA 


TCTTCACCTA 


3240 


Mft A MMMMffMMM *m 

GATCCTTTTA 


AATTAAAAAT 


GAAGTTTTAA 


ATCAATCTAA AGTATATATG 


•B MkMB «Bt «H JMBMJIA 

AGTAAACTTG 


3300 


GTCTGACAGT 


TACCAATGCT 


TAATCAGTGA 


-mm Mm A MMMC JMM MJtV% M At MfrMk A MMMMj 

GGCACCTATC TCAGCGATCT 


GTCTATTTCG 


M M ^M 

3360 


TTCATCCATA 


GTTGCCTGAC 


TCGCCGTCGT 


MB MB B MR B MM B Hi» MM MM 

GTAGATAACT ACGATACGGG 


« MMMMMt MM 

AGGGCTTACC 


3420 


ATCTGGCCCC 


AGTGCTGCAA 


jmm A*% m «t« 4*BM%^% A}«A 

TGATACCGCG 


V M V MMMB MMM MM| MMMMMHt^W 

AGACCCACGC TCACCGGCTC 


Mf Mj MMMHMffftM MA MM 

QU3ATTTATC 


MAMA 

3480 


A At Ml m a mm mm m m A* 

AGCAATAAftC 


CAGCCAGCCG 


GAAGGGCCGA 


MMMMB MB B Mil MM— MjMMIM^BB B 

GCGCAGAAGT GGTCCTGCAA 


CTTTATCCGC 


3540 


MMIMMB MMM* M 

CTCCATCCAG 


TCTATTAATT 


GTTGCCGGGA 


« MMjfM« MB M*MB B MHtB MWMMH 

AGCTAGAGTA AGTAGTTCGC 


M« HBAft « M 

CAGTTAATAG 


3600 


TTTGCGCAAC 


GTTGTTGCCA 


imM MiWM^< %\ mm 

TTGCTGCAGG 


OH fnMMHrM^IR^ ftlM B MMMfRMMflt 

CATCGTGGTG TCACGCTCGT 


CGTTTGGTAT 


3660 


GGCTTCATTC 


AGCTCCGGTT 


CCCAACGATC 


A&6GCG&6TT RCATGATCCC 


CCATGTTGTG 


3720 


CAAAAAAGCG 


GTTAGCTCCT 


TCGGTCCTCC 


GATCGTTGTC AGARGTAAGT 


TGGCCGCAGT 


3780 


GTTATCACTC 


ATGGTTATGG 


CAGCACTGCA 


TAATTCTCTT ACTGTCflTGC 


CATCCGTAAG 


3840 


ATGCTTTTCT 


GTGACTGGTG 


AGTACTCAAC 


CAAGTCATTC TGAGAATAGT 


GTATGCGGCG 


3900 
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ACCGAGTTGC TCTTGCCCG6 CGTCAACACG GGATAATACC GCGCCACATA GCAGAACTTT 3960 

AAAAGTGCTC ATCATTGGAA AACGTTCTTC GGGGCGAAAA CTCTCAAGGA TCTTACCGCT 4020 

GTTGAGATCC AGTTCGATGT AACCCACTCG TGCACCCAAC TGATCTTCAG CAICTTTTAC 4080 

TTTCACCAGC GTTTCTGGGT GAGCAAAAAC AGGAAGGCAA AATGCCGCAA AAAAGGGAAT 4140 

AAGGGCGACA CGGAAATGTT GAATACTCAT ACTCTTCCTT TTTCAATATT ATTGAAGCAT 4200 

TTATCAGGGT TATTGTCTCA TGAGCGGATA CAtATTTGM TGTATTTAGA AAAATAAACft 4260 

AATAGGGGTT CCGCGCACAT TTCCCCGAAA AGTGCCACCT GACGTCTAAG AAACCATTAT 4320 

TATCATGACA TTAACCTATA AAAATAGGCG TATCACGAGG CCCTTTCGTC TTCAAGAA 4378 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA (genomic) 

m 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 10: 
AAAGACTCCG CGGGCGAAGT T 

(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: YES 
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(zi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
TTATCTAGAG TCGTTGGTCC AACCTTCATC 

» 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4366 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DRA (genomic) 
(iii) HYPOTHETICAL: NO 

m 

(iii) ANTI-SENSE: NO 



(zi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



TTCAGGTAAA TTTGATGTAC 


ATCAAATGGT 


ACCCCTTGCT GAATCGTTAA GGTAGGCGGT 


60 


AGGGCCCAGA TCTTAATCAT 


CCACAGGAGA 


TET C GENE START CODON 
CTTTCTGATG AAAAACCTTG ATTGTTGGGT 


120 


CGACAACGAA GAAGACATCG 


ATGTTATCCT 


GAAAAAGTCT 


ACCATTCTGA ACTTGGACAT 

• 


180 


CAACAACGAT ATTATCTCCG 


ACATCTCTGG 


TTTCAACTCC 


TCTGTTATCA CATATCCAGA 


240 


TGCTCAATTG GTGCCGGGCA 


TCAACGGCAA 


AGCTATCCAC 


CTGGTTAACA ACGAATCTTC 


300 


TGAAGTTATC GTGCACAAGG 


CCATGGACAT 


CGAATACAAC 


GACATGTTCA ACAACTTCAC 


360 


CGTTAGCTTC TGGCTGCGCG 


TTCCGAAAGT 


TTCTGCTTCC 


CACCTGGAAC AGTACGGCAC 


420 


TAACGAGTAC TCCATCATCA 


GCTCTATGAA 


GAAACACTCC 


CTGTCCATCG GCTCTGGTTG 


480 


GTCTGTTTCC CTGAAGGGTA 


ACAACCTGAT 


CTGGACTCTG 


SacII x 
AAAGACTCCG CGGGCGAAGT 


540 


TCGTCAGATC ACTTTCCGCG 


ACCTGCCGGA 


CAAGTTCAAC 


GCGTACCTGG CTAACAAATG 


600 


GGTTTTCATC ACTATCACTA 


ACGATCGTCT 


GTCTTCTGCT 


AACCTGTACA TCAACGGCGT 


660 


TCTGATGGGC TCCGCTGAAA 


TCACTGGTCT 


6GGC6CTATC 


CGTGAGGACA ACAACATCAC 


720 


TCTTAAGCTG GACCGTTGCA 


ACAACAACAA 


CCAGTACGTA 


TCCATCGACA AGTTCCGTAT 


780 


CTTCTGCAAA GCACTGAACC 


CGAAAGAGAT 


CGAAAAACTG 


TATACCAGCT ACCTGTCTAT 


840 
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CACCTTCCTG CGTGACTTCT GGGGTAACCC 


GCTGCGTTAC GACACCGAAT 


ATTACCTGAT 


900 


CCCGSTAGCT TCTAGCTCTA AAGACGTTCA 


GCTGAA&AAC ATCACTGACT 


ACATGTACCT 


960 


GACCAACGCG CCGTCCTACA CTAACGGTAA 


ACTGAACATC TACTACCGAC 


GTCTGTACAA 


1020 


CGGCCTGAAA TTC&TC&TC& AACGCTACAC 


TCCGAACAAC GAfiATCG&TT 


CTTTCGTTAA 


1080 


ATCTGGTGAC TTCATCAAAC TGTACGTTTC 


TTACAACAAC AACGAAOO 


TCGTTGGTTA 


1140 


CCCGAA&6AC GGTAACGCTT 7CAACAACCT 


GGACAGAAT? CTGCGTGTTG 


GTTACAACGC 


1200 


TCCGGGTATC CCGCTGTACA AAAAAATGGA 


AGCTGTTAAA CTGCGTGACC 


TGAAAACCTA 


1260 


CTCTGTTCAG CTGAAACTGT ACGACGACAA 

• 


AAACGCTTCT CTGGGTCTGG 


TTGGTACCCA 


" 1320 


CAACGGTCAG ATCGGTAACG ACCCGAACCG 


TGACATCCTG ATCGCTTCTA 


ACTGGTACTT 


1380 


CAACCACCTG AAAGACAAAA TCCTGGGTTG 


CGACTGGTAC TTCGTTCCGA 


CCGATGAAGG 


1440 



Xbal S. Hanson i P28 GENE START 

TTGGACCAAC GACTCTAGAA TGGCTGGCGA GCATATCAAG GTTATCTATT TTGACGGACG 1500 

CGGACGTGCT GAATCGATTC GGATGACTCT TGTGGCAGCT GGTGTAGACT ACGAAGATGA . 1560 

i 

GAGAATTAGT TTCCAAGATT GGCCAAAAAT CAAACCAACT ATTCCAGACG GACGATTGCC 1620 

TGCAGTGAAA GTCACTGATG ATCATGGGCA CGTGfiAATGG ATGTTAGAGA GTTTGGCTAT 1680 

TGCACGGTAT ATGGCGAAGA AACATCATAT GATGGGTGAA ACAGACGAGG AATACTATAG 1740 

TGTTGAAAAG TTGATTGGTC ATGCTGAAGA TGTAGAACAT GAATATCACA AAACTTTGAT 1800 

GAAGCCACAA GAAGAGAAAG AGAAGATAAC CAAAGAGATA TTGAACGGCA AAGTTCCAGT 1860 

TCTTCTCAAT ATGATCTGCG AATCTCTGAA AGGGTCGACA GGAAAGCTGG CTGTTGGGGA 1920 

CAAAGTAACT CTAGCTGATT TAGTCCTGAT TGCTGTCATT GATCATGTGA CTGATCTGGA 1980 

TAAAGGATTT CTAACTGGCA AGf ATCCTG& GATCCATA&A CATCGAGAAA ATCTGTTAGC 2040 

STOP BamHI 

CAGTTCACCG CGTTTGGCGA AATATTTATC GAACAGGCCT GCAACTCCCT TCTAAGGATC 2100 

CGCTAGCCCG CCTAATGAGC GGGCTTTTTT TTCTCGGGCA GCGTTGGGTC CTGGCCACGG 2160 

GTGCGCATGA TCGTGCTCCT GTCGTTGAGG ACCCGGCTAG GCTGGCGGGG TTGCCTTACT 2220 

GGTTAGCAGA ATGAATCACC GATACGCGAG CGAACGTGAA GCGACTGCTG CTGCAAAACG 2280 

TCTGCGACCT GAGCAACAAC ATGAATGGTC TTCGGTTTCC GTGTTTCGTA AAGTCTGGAA 2340 

ACGCGGAAGT CAGCGCTCTT CCGCTTGCTC GCTCACTGAC TCGCTGCGCT CGGTCGTTCG 2400 
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GCTGCGGCGA GCGGTATCAG CTCACTCAAA GGCGGTAATA CGGTTATCCA CAGAATCAGG 2460 

GGATAACGCA GGAAAGAACA TGTGAGCAAA AGGCCAGCAA AAGGCCAGGA ACCGTAAAAA 2520 

GGCCGOGTTG CTGGCGTTTT TCCATAGGCT CCGCCCCCCT GACGAGCATC ACflMAATCG 2580 

ACGCTCAAGT CAGAGGTGGC GAAACCCGAC AGGACTATM AGATACCAGG CGTTTCCCCC 2640 

TGGAAGCTCC CTCGTGCGCT CTCCTGTTCC GACCCTGCCG CTTACCGGAT ACCTGTCCGC 2700 

CTTT C ICCCI TCGGGMGCG TG6CGCTTTC TCA&T6CTCA CGCTGTAGGT ATGTCAGTTC 2760 

GGTGTAGGTC GTTCGCTCCA AGCTGGGCTG TGTGCACGAA CCCCCCGTTC AGCCCGACCG 2820 

CT6CGCCTTA TCCGGTAACT ATCGTCTTGA GTCCAACCCG GTAAGACACG ACTTATCGCC 2880 

ACTGGCAGCA GCCACTGGTA ACAGGATTAG CAGAGCGAGG TATGTAGGCG GTGCTACAGA 2940 

GTTCTTGAAG TGGTGGCCTA ACTACGGCTA CACTAGAAGG ACAGTATTTG GTATCTGCGC 3000 

TCTGCTGAAG CCAGTTACCT TCGGAAAAAG AGTTGGTAGC TCTTGATCCG GCAAACAAAC 3060 

CACCGCTGGT AGCGGTGGTT TTTTTGTTTG CAAGCAGCAG ATTACGCGCA GAAAAAAAGG 3120 

ATCTCAAGAA GATCCTTTGA TCTTTTCTAC GGGGTCTGAC GCTCAGTGGA ACGAAAACTC 3180 

ACGTTAAGGG ATTTTGGTCA TGAGATTATC AAAAMGATC TTCACCTAGA TCCTTTTAAA 3240 

TTAAAAATGA AGTTTTAAAT CAATCTAAAG TATATATGAG TAAACTTGGT CTGACAGTTA 3300 

CCAATGCTTA ATCAGTGAGG CACCTATCTC AGCGATCTGT CTATTTOGTT CATCCATAGT 3360 

TGCCTGACTC CCCGTCGTGT AGATAACTAC GATACGGGAG GGCTTACCAT CTGGCCCCAG 3420 

TGCTGCAATG ATACCGCGAG ACCCACGCTC ACCGGCTCCA GATTTATCAG CAATAAACCA 3480 

» 

6CC&GCCG6A AG66CCG&GC GCAGAAGTGG TCCTGCAACT TTATCCGCCT CCATCCAGTC 3540 

TATTAATTGT TGCCGGGAAG CTAGAGTAAG TAGTTCGCCA GTTAATAGTT TGCGCMCGT 3600 

TGTTGCCATT GCTGCAGGCA TCGTGGTGTC AC6CTCGTCG TTTGGTATGG CTTCATTCAG 3660 

CTCCGGTTCC CAACGATCAA GGCGAGTTAC ATGATCCCCC ATGTTGTGCA AAAAAGCGGT 3720 

TAGCTCCTTC GGTCCTCCGA TCGTTGTCAG AAGTAAGTTG GCCGCAGTGT TATCACTCAT 3780 

GGTTATGGCA GCACTGCATA ATTCTCTTAC TGTCATGCCA TCCGTAAGAT GCTTTTCTGT 3840 

GACTGGTGAG T&CTCAACCA AGTCATTCTG AGAATAGTGT ATGCGGCGAC CGAGTTGCTC 3900 

TTGCCCGGCG TCAACACGGG ATAATACCGC GCCACATAGC AGAACTTTAA AAGTGCTCAT 3960 

CATTGGAAAA CGTTCTTCGG GGCGAAAACT CTCAAGGATC TTACCGCTGT TGAGATCCAG 4020 
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TTCGATGTAA CCC&CTCGTG CACCCAACTG ATCTTCAGCA TCTTTTACTT TCACCAGCGT 4080 

TTCTGGGTGA GfAAAAACAG GAAGGCAAAA TGCCGCAAAA AAGGGAATAA GGGCGACACG 4140 

GAAATGTTGA ATACTCATAC TCTTCCTTTT TCAATATTAT TGAAGCftTTT ATCAGGGTTA 4200 

TTGTCTCATG AGCGGATACA TATTTGAATG TATTTAGAAA AATAAACAAA TAGGGGTTCC 4260 

GCGCACATTT CCCCGAAAAG TGCCACCTGA CGTCTAAGAA ACCATTATTA TCATGACATT 4220 

AACCTATAAA AATAGGCGTA TCACGAGGCC CTTTCGTCTT CAAGAA 4366 
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CLAIMS 



A DNA construct comprising a DNA sequence encoding a 
fusion protein of the formula TetC-(Z) a -Het, wherein 
TetC is the C fragment of tetanus toxin, or a protein 
comprising the epitopes thereof; Het is a heterologous 
protein, 2 is an amino acid, and a is zero or a 
positive integer, provided that (Z) a does not include 
the sequence Gly-Pro. 



A DNA construct according to Claim 1 wherein (Z) a is a 
of 0 to 15 amino 



3. A DNA construct according to Claim 2 wherein (Z) 3 is a 
chain of less than 4 amino acids. 



4. A DNA construct according to Claim 3 wherein (Z) a is a 
chain of two or three amino acids, the DNA sequence 
for which defines a restriction endonuclease cleavage 
site. 



5. A DNA construct according to Claim 2 wherein a is 



6. A DNA construct according to Claim 2 in which (Z) a is 
free from glycine and/ or proline. 



7. A DNA c nstruct according t any one of the preceding 
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Claims wherein the heterologous protein Het is an 

antigenic sequence derived fron a virus, bacterium, 

« 

fungus, yeast or parasite. 

8. A DNA construct according to Claim 7 wherein the 
heterologous protein Het is the Schistosoma mansoni 
P28 glutathione S- transferase antigen. 

9. A replicable expression vector, for example suitable 
for use in bacteria , containing a DMA construct as 
defined in any one of Claims 1 to 8. 

10. A host, for example, a bacterium, having integrated 
1 into the chromosomal DNA thereof a DNA construct as 

defined in any one of Claims 1 to 8. 

11. A fusion protein as defined in any one of Claims 1 to 

* 

12. A process for the preparation of a bacterium 
(preferably an attenuated bacterium), which process 
comprises transforming a bacterium with a DNA 
construct as defined in any one of Claims 1 to 8. 

13. A vaccine composition comprising a fusion protein, or 
an attenuated bacterium expressing said fusion 
protein, the fusion protein being as defined in any 
one of Claims 1 to 8; and a pharmaceutical ly 

NSDOC0* <WO_9KH16lA2LL> 



WO 95/04151 



PCT/GB94/01647 



38 



acceptable carri r. 



14. A method of immunising a patient/ e.g. a human 



patient, which comprises administering to the patient 



tiding amount of a vaccine 



composition as defined in Claim 13. 
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